Home > Search results

Search results

Your search

kw.\*:("Reinforcement learning")

Filter

A-Z Z-A Frequency ↓ Frequency ↑

PASCAL (1084)
FRANCIS (126)

Export in CSV

Document Type [dt]

A-Z Z-A Frequency ↓ Frequency ↑

Article (591)
Conference Paper (473)
Book Chapter (13)
Conference Proceedings (4)
Thesis (3)
Serial Issue (2)
Book (1)

Export in CSV

Publication Year[py]

A-Z Z-A Frequency ↓ Frequency ↑

2004 (107)
2006 (106)
2013 (102)
2005 (94)
2014 (89)
2011 (68)
2010 (62)
2002 (55)
2009 (53)
2000 (51)
2003 (45)
2007 (44)
2012 (44)
2008 (43)
1998 (41)
2001 (40)
1999 (20)
1997 (12)
2015 (7)
1996 (4)

Export in CSV

Discipline (document) [di]

A-Z Z-A Frequency ↓ Frequency ↑

Computer science : theoretical automation and systems (840)
Telecommunications and information theory (102)
Psychology. Ethology (83)
Operational research. Management (72)
Mathematics (43)
Psychopathology. Psychiatry. Clinical psychology (38)
Electrical engineering. Electroenergetics (32)
Vertebrates : nervous system and sense organs (32)
Building. Public works. Transport. Civil engineering (24)
Biological sciences. Generalities. Modelling. Methods (18)
Generalities in biological sciences (18)
Mechanical engineering. Mechanical construction. Handling (13)
Economy. Legislation. Training. Society (12)
Energy (11)
Electronics (10)
Neurology (7)
Public health. Hygiene-occupational medicine (5)
Sciences of information and communication (5)
Linguistics (4)
Pharmacological treatments (4)
Theoretical physics (4)
Physics : solid mechanics (3)
Radiotherapy. Instrumental treatment. Physiotherapy. Reeducation. Rehabilitation, speech therapy, crenotherapy. Dietary management and various treatments (3)
Scanning and diagnostic techniques (3)
Earth sciences (2)
External geophysics (2)
Metrology (2)
Agrifood industries (1)
Agronomy. Soil sciences and vegetal productions (1)
Animal, vegetal and microbial ecology (1)
Biotechnology (1)
Generalities in medical sciences (1)
Gynecology. Andrology. Obstetrics (1)
Metabolic diseases (1)
Metals. Metallurgy (1)
Molecular biophysics (1)
Physics : acoustics (1)
Physics : optics (1)
Pollution (1)
Polymer industry, paints, wood (1)
Sociology (1)
Surgery. Transplants, organs and tissues grafting. Graft pathologies (1)
Vertebrates : general zoology, morphology, phylogeny, systematics, cytogenetics, geographical distribution (1)

Export in CSV

Language

A-Z Z-A Frequency ↓ Frequency ↑

English (1063)
French (24)

Export in CSV

Author Country

A-Z Z-A Frequency ↓ Frequency ↑

United States (385)
Japan (141)
United Kingdom (132)
France (89)
Germany (86)
China (84)
Canada (76)
Switzerland (43)
Italy (42)
Spain (41)
Australia (40)
Netherlands (36)
Korea, Republic of (32)
Iran, Islamic Republic of (27)
Belgium (26)
Greece (25)
Singapore (25)
India (24)
Brazil (22)
Hong-Kong (17)
Taiwan, Province of China (17)
Israel (16)
Finland (13)
Hungary (12)
Mexico (9)
New Zealand (9)
Turkey (9)
Poland (8)
Sweden (8)
Austria (7)
Portugal (6)
Saudi Arabia (5)
Croatia (4)
Denmark (4)
Ireland (4)
Argentina (3)
Cyprus (3)
Malaysia (3)
Norway (3)
Slovenia (3)
Thailand (3)
Czech Republic (2)
International (2)
Romania (2)
Serbia (2)
United Arab Emirates (2)
Algeria (1)
Bahrain (1)
Botswana (1)
Bulgaria (1)
Colombia (1)
Estonia (1)
Europe (1)
Iceland (1)
Indonesia (1)
Jordan (1)
Lebanon (1)
Morocco (1)
Pakistan (1)
Qatar (1)
Tunisia (1)
Venezuela (1)
Viet Nam (1)
Yugoslavia (1)

Export in CSV

Origin

A-Z Z-A Frequency ↓ Frequency ↑

Inist-CNRS (1087)

Export in CSV

Results 1 to 25 of 1087

Page / 44

Display by page

Sort by :

Export

Selection :

Selected items (0)
Items between and
All items

Format :

Reinforcement learning for discounted values often loses the goal in the application to animal learningYAMAGUCHI, Yoshiya; SAKAI, Yutaka.Neural networks. 2012, Vol 35, pp 88-91, issn 0893-6080, 4 p.Article

Adaptive game AI with dynamic scripting : Machine learning and gamesSPRONCK, Pieter; PONSEN, Marc; SPRINKHUIZEN-KUYPER, Ida et al.Machine learning. 2006, Vol 63, Num 3, pp 217-248, issn 0885-6125, 32 p.Article

DistanceRank : An intelligent ranking algorithm for web pagesALI MOHAMMAD ZAREH BIDOKI; YAZDANI, Nasser.Information processing & management. 2008, Vol 44, Num 2, pp 877-892, issn 0306-4573, 16 p.Article

Graph kernels and Gaussian processes for relational reinforcement learningDRIESSENS, Kurt; RAMON, Jan; GÄRTNER, Thomas et al.Machine learning. 2006, Vol 64, Num 1-3, pp 91-119, issn 0885-6125, 29 p.Conference Paper

Evidence for learning to learn behavior in normal form gamesSALMON, Timothy C.Theory and decision. 2004, Vol 56, Num 4, pp 367-404, issn 0040-5833, 38 p.Article

A Modified Memory-Based Reinforcement Learning Method for Solving POMDP ProblemsLEI ZHENG; CHO, Siu-Yeung.Neural processing letters. 2011, Vol 33, Num 2, pp 187-200, issn 1370-4621, 14 p.Article

The asymptotic equipartition property in reinforcement learning and its relation to return maximizationIWATA, Kazunori; IKEDA, Kazushi; SAKAI, Hideaki et al.Neural networks. 2006, Vol 19, Num 1, pp 62-75, issn 0893-6080, 14 p.Article

The first learning track of the international planning competitionFERN, Alan; KHARDON, Roni; TADEPALLI, Prasad et al.Machine learning. 2011, Vol 84, Num 1-2, pp 81-107, issn 0885-6125, 27 p.Article

Hippocampal replay contributes to within session learning in a temporal difference reinforcement learning modelJOHNSON, Adam; REDISH, A. David.Neural networks. 2005, Vol 18, Num 9, pp 1163-1171, issn 0893-6080, 9 p.Article

Feedforward neural networks in reinforcement learning applied to high-dimensional motor controlCOULOM, Rémi.Lecture notes in computer science. 2002, pp 403-413, issn 0302-9743, isbn 3-540-00170-0, 11 p.Conference Paper

An Actor―Critic based controller for glucose regulation in type 1 diabetesDASKALAKI, Elena; DIEM, Peter; MOUGIAKAKOU, Stavroula G et al.Computer methods and programs in biomedicine (Print). 2013, Vol 109, Num 2, pp 116-125, issn 0169-2607, 10 p.Article

Economic impact assessment and operational decision making in emission and transmission constrained electricity markets : Smart GridsNANDURI, Vishnu; KAZEMZADEH, Narges.Applied energy. 2012, Vol 96, pp 212-221, issn 0306-2619, 10 p.Article

Embedding a priori knowledge in reinforcement learningRIBEIRO, C. H. C.Journal of intelligent & robotic systems. 1998, Vol 21, Num 1, pp 51-71, issn 0921-0296Article

Towards a life-long learning soccer agentKLEINER, Alexander; DIETL, Markus; NEBEL, Bernhard et al.Lecture notes in computer science. 2003, pp 126-134, issn 0302-9743, isbn 3-540-40666-2, 9 p.Conference Paper

Combining exploitation-based and exploration-based approach in reinforcement learningIWATA, Kazunori; ITO, Nobuhiro; YAMAUCHI, Koichiro et al.Lecture notes in computer science. 2000, pp 326-331, issn 0302-9743, isbn 3-540-41450-9Conference Paper

Reinforcement learning : Past, present and futureSUTTON, R. S.Lecture notes in computer science. 1999, pp 195-197, issn 0302-9743, isbn 3-540-65907-2Conference Paper

Finding hidden hierarchy in reinforcement learningPOULTON, Geoff; YING GUO; WEN LU et al.Lecture notes in computer science. 2005, issn 0302-9743, isbn 3-540-28894-5, vol3, 554-561Conference Paper

Relational reinforcement learningDRIESSENS, Kurt.Lecture notes in computer science. 2001, pp 271-280, issn 0302-9743, isbn 3-540-42312-5Conference Paper

Experimental evidence on case-based decision theoryOSSADNIK, Wolfgang; WILMSMANN, Dirk; NIEMANN, Benedikt et al.Theory and decision. 2013, Vol 75, Num 2, pp 211-232, issn 0040-5833, 22 p.Article

Models of trace decay, eligibility for reinforcement, and delay of reinforcement gradients, from exponential to hyperboloidKILLEEN, Peter R.Behavioural processes. 2011, Vol 87, Num 1, pp 57-63, issn 0376-6357, 7 p.Article

Dissociated roles of the anterior cingulate cortex in reward and conflict processing as revealed by the feedback error-related negativity and N200BAKER, Travis E; HOLROYD, Clay B.Biological psychology. 2011, Vol 87, Num 1, pp 25-34, issn 0301-0511, 10 p.Article

On the possibility of learning in reactive environments with arbitrary dependenceRYABKO, Daniil; HUTTER, Marcus.Theoretical computer science. 2008, Vol 405, Num 3, pp 274-284, issn 0304-3975, 11 p.Conference Paper

A two-layered multi-agent reinforcement learning model and algorithm : Information technologyWANG, Ben-Nian; YANG GAO; CHEN, Zhao-Qian et al.Journal of network and computer applications. 2007, Vol 30, Num 4, pp 1366-1376, issn 1084-8045, 11 p.Conference Paper

Physiological and behavioral signatures of reflective exploratory choiceOTTO, A. Ross; KNOX, W. Bradley; MARKMAN, Arthur B et al.Cognitive, affective & behavioral neuroscience (Print). 2014, Vol 14, Num 4, pp 1167-1183, issn 1530-7026, 17 p.Article

Socially embedded cognitionHUEBNER, Bryce.Cognitive systems research (Print). 2013, Num 25-26, pp 13-18, issn 2214-4366, 6 p.Article

Page / 44